Sequencing near infrared and infrared fluorescence labeled DNA for detecting using laser diodes and suitable labels therefor

ABSTRACT

To sequence DNA automatically, DNA marked with far infrared, near infrared, or infrared fluorescent dyes are electrophoresed in a plurality of channels through a gel electrophoresis slab or capillary tubes wherein the DNA samples are resolved in accordance with the size of DNA fragments in the gel electrophoresis slab or capillary tubes into fluorescently marked DNA bands. The separated samples are scanned photoelectrically with a laser diode and a sensor, wherein the laser scans with scanning light at a wavelength within the absorbance spectrum of said fluorescently marked DNA samples and light is sensed at the emission wavelength of the marked DNA.

REFERENCE TO RELATED APPLICATIONS

This application is a streamline divisional of application Ser. No. 08/204,627 filed Mar. 1, 1994, now U.S. Pat. No. 5,571,388 which application Ser. No. 08/204,627 is firstly a continuation-in-part of U.S. application Ser. No. 07/950,734 filed Sep. 24, 1992, now U.S. Pat. No. 5,346,603, which is a continuation of U.S. application Ser. No. 07/799,712 filed Nov. 26, 1991, now abandoned, which is a continuation of U.S. application Ser. No. 07/632,605 filed Dec. 24, 1990, now abandoned, which is a continuation of U.S. application Ser. No. 07/078,279 filed Jul. 27, 1987, now abandoned, which is a divisional of U.S. application Ser. No. 594,676 for DNA SEQUENCING filed by Middendorf et al. on Mar. 29, 1984, and assigned to the same assignee as this application, now U.S. Pat. No. 4,729,947; and which application Ser. No. 08/204,627 is secondly a continuation-in-part of U.S. application Ser. No. 07/860,140 filed Mar. 30, 1992, now U.S. Pat. No. 5,366,603, which is a divisonal of U.S. application Ser. No. 07/763,230 filed Sep. 20, 1991, now U.S. Pat. No. 5,230,781, which is a continuation-in-part of application Ser. No. 07/570,503, filed Aug. 21, 1990, now U.S. Pat. No. 5,207,880, which is a continuation-in-part of U.S. application Ser. No. 07/078,279 filed Jul. 27, 1987, now abandoned, which is a divisonal of U.S. application Ser. No. 594,676 for DNA SEQUENCING filed by Middendorf et al. on Mar. 29, 1984, and assigned to the same assignee as this application, now U.S. Pat. No. 4,729,947.

BACKGROUND OF THE INVENTION

This invention relates to the sequencing of fluorescence labeled DNA and the detecting of the DNA after irradiation by light from a laser, and suitable labels therefor.

In one class of techniques for sequencing DNA, identical strands of DNA are marked with fluorescent dye. The strands are marked by attaching specially synthesized fluorescent oligonucleotide primers or probes to the strands of DNA or by attaching the fluorescent dye directly to the strands.

The strands are separated into four aliquots. The strands in a given aliquot are either individually cleaved at or synthesized to any base belonging to only one of the four base types, which are adenine, guanine, cytosine and thymine (hereinafter A, G, C and T). The adenine-, guanine-, cytosine- and thymine- terminated strands are then electrophoresed for separation and the separated strands are irradiated by a laser and the emission detected. The rate of electrophoresis indicates the DNA sequence.

In a prior art sequencing technique of this class, the fluorescent dyes used as markers have their maximum emission spectra in the visible range, the DNA is subject to irradiation in the visible spectra, and visible spectra detectors and light sources are used. Generally photomultipliers tubes are used for detection.

The prior art techniques for DNA sequencing have several disadvantages such as: (1) because the dyes have their emission spectra in the visible region of light spectrum, the lasers used to excite the fluorescent markers, and under some circumstances, the detectors for the light tend to be expensive; and (2) they are relatively noisy due to the background interferences by biomolecules.

Cyanine dyes are known to absorb infrared and near infrared light and techniques for the synthesis of derivatives of the cyanine dyes are known. However, obtaining absorbance and emission bands that reduce the effect of background noise during gel electrophoresis apparatus when irradiating with a diode laser scanning has been difficult.

SUMMARY OF THE INVENTION

Accordingly, it is an object of the invention to provide a novel technique for DNA sequencing.

It is a further object of the invention to provide novel equipment and methods for the sequencing of near infrared and infrared fluorescence labeled DNA and the detection of the DNA after irradiation by light from laser diodes.

It is a still further object of the invention to provide novel dyes that fluoresce in the infrared or near infrared region.

It is a still further object of the invention to provide a new and novel method of synthesizing dyes having certain characteristics

It is a still further object of the invention to provide novel dyes that fluoresce in the infrared or near infrared region in selected wavelengths when attached to oligonucleotides and that have substantial quantum yield.

It is a still further object of the invention to provide a novel probe or primer that contains a dye that fluoresces in the infrared or near infrared region.

It is a still further object of the invention to provide a technique for continuous sequencing of DNA.

It is a still further object of the invention to provide a novel technique for continuously sequencing DNA using fluorescent detection.

It is a still further object of the invention to provide a novel technique for DNA sequencing using a novel fluorescent marker fastened to the DNA.

It is a still further object of the invention to provide a novel technique for continuously sequencing DNA marked with fluorescence which more clearly distinguishes marked DNA fragments from background fluorescence.

It is a still further object of the invention to provide a novel fluorescent marker, method of synthesizing the marker and of attaching it to DNA.

It is a further object of the invention to provide a novel method, marker and equipment that permits the detection of marked molecules, including DNA, using diode light sources and detectors.

It is a further object of this invention to provide novel fluorescent dyes.

In accordance with the above and further objects of the invention, the sequencing of near infrared and infrared fluorescent labeled DNA and the detection of the DNA after irradiation by infrared or near infrared light from a laser diode is accomplished using a novel label. The strands of DNA are continuously electrophoresed and identified for any of several purposes, such as for example: (1) DNA sequencing; and (2) analysis of strands varying in length as prepared by such techniques as restriction enzyme cutting or polymerase chain reaction (PCR).

To aid in identification, the strands are marked with fluorescent labels that emit light in the infrared or near infrared region. The strands are irradiated with light in the infrared or near infrared region and the light emitted from the fluorescent labels is detected and used to obtain information about the DNA strands.

The marking is by direct labeling of fluorescent markers to the strands or by fluorescently labeled probes or primers hybridized to the separated strands. In one embodiment, the strands are detected by scanning with an infrared or near infrared laser diode light source.

The scanning apparatus includes a laser diode that emits light of a wavelength near or in the optimum absorption spectrum of the marker. In the preferred embodiment, the light source is a diode laser that irradiates channels of DNA strands with infrared light having a wavelength that matches the absorbance region of the marker. The detector includes a light sensor which is preferably an avalanche photodiode sensitive to the near infrared light emission of the marker. It may include a filtering system having a pass band suitable for passing selectively the optimum emission of the fluorescent marker to the light sensor.

The photodiode, photomultiplier or other light detector selectively detects the fluorescense, using techniques which enhance the signal/noise ratio. One technique is to modulate the laser source by pulsing the electrical current driving the laser, with detection through a lock-in amplifier. The detection is made in a wavelength band including the high emission spectrum of the fluorescent marker.

To mark the DNA strand, a novel dye is synthesized by a novel route having the desired properties or a known dye is modified. In the preferred embodiment, the novel dye has a preferred spectrum, has high absorption and fluorescence properties and has at least one reactive group enabling coupling to biomolecules such as DNA, proteins, and antibodies.

The dye is synthesized, modified from a known dye or selected to have an absorbance band and an emission band within a region encompassing the near infrared and infrared regions when attached to a probe, primer, oligonucleotide or other molecule. The dye should provide high quantum yield in a optical band selected to reduce background noise. The preferred dyes for many applications calling for the labeling biomolecules are cyanine dyes having a NCS group on the dye that may react with the amino goup of the biomolecule to form a thiourea linkage or having NHS esters or hydroxyl groups for the purpose of reacting with a biomolecule to form linkages.

The preferred dyes are penta methine or heptamethine cyanines which absorb light having wavelengths in the region of 630 to 900 nm. This wavelength is suitable for reducing background fluorescence in DNA sequencing and corresponds to the radiation wavelength of the GaAlAs diode laser which, for example, is 780 nm. The GaAlAs diode is used for scanning the gel electrophoresis sandwich using the DNA sequencing.

In another embodiment, the infrared or near infrared dyes are selected to provide at least four dyes with each dye having its excitation and/or radiation spectra spaced sufficiently from each other dye so the fluorescence from each dye can be distinguished from each other dye either by the wavelength that excites it into fluorescence or the wavelength at which it fluoresces or both. The wavelength spacing is maintained sufficiently close to be excited by laser diodes. The dyes may be incorporated in probes and primers for attachment to oligonucleotides by such methods for example as described Ruth, Jerry L. (1984) DNA 3, 123.

From the above summary, it can be understood that the techniques for the sequencing of fluorescence labeled DNA of this invention have several advantages, such as: (1) because the dyes have their emission spectra in the infrared or near infrared light spectrum, small inexpensive infrared diode lasers may be used; and (2) they are characterized by relatively low noise.

SUMMARY OF THE DRAWINGS

The above noted and other features of the invention will be better understood from the following detailed description when considered with reference to the accompanying drawings in which:

FIG. 1 is a perspective view of an embodiment of sequencer usable in the invention;

FIG. 2 is a sectional view taken through lines 2--2 of FIG. 1;

FIG. 3 is a sectional view of a portion of FIG. 1 taken through lines 3--3;

FIG. 4 is an exploded perspective view of a portion of the embodiment of FIG. 2;

FIG. 5 is an enlarged view, partly broken away, of a portion of the embodiment of FIG. 2; and

FIG. 6 is a block diagram of a circuit that may be used for coordination of a sensor, scanner drive and laser used.

DETAILED DESCRIPTION

The sequencing of infrared fluorescence labeled DNA and the detection of the DNA after irradiation by infrared light from a laser diode is accomplished using an infrared label prepared for this purpose and either directly attached to the DNA or attached to probes or primers that will be attached to the DNA. In this specification the word "infrared" will be used at times to include near infrared wavelengths (700-900 nm) infrared (630-700 nm) and far infrared (900-3000 nm). The strands of DNA are continuously electrophoresed and identified for any of several purposes, such as for example: (1) DNA sequencing; and (2) analysis of strands varying in length as prepared by such techniques as restriction enzyme cutting or polymerase chain reaction (PCR).

The strands are marked with fluorescent labels that have their maximum fluorescense and their maximum absorbance at wavelengths of light in the infrared and near infrared region. The strands are irradiated with light in the infrared or near infrared region from a laser diode and the light emitted from the fluorescent labels is detected and used to obtain information about the DNA strands. The detector includes a light sensor which is preferably an avalanche photodiode sensitive to the near infrared light emission of the marker. It may include a filtering system having a pass band suitable for passing selectively the optimum emission of the fluorescent marker to the light sensor.

To mark the DNA strand, a dye is prepared having the desired properties or a known dye is modified. In the preferred embodiment a novel dye having the preferred spectrum, high absorption and fluorescence properties, and at least one reactive group enabling coupling to biomolecules such as DNA, proteins, and antibodies is synthesized.

The dye is synthesized, modified from a known dye or selected to have an absorbance band and an emission band within a region encompassing the near infrared and infrared regions when attached to a probe, primer, oligonucleotide or other molecule. The dye should provide high quantum yield in a band selected to reduce background noise. The preferred dyes for many applications calling for the labelling biomolecules are cyanine dyes having a NCS group on the dye that may react with the amino goup of the biomolecule to form a thiourea linkage.

In the preferred embodiment, cyanine dyes are synthesized. The preferred dyes are heptamethine cyanines which efficiently absorb light having wavelengths in the region of 750 to 820 nm (nanometers) (maximum absorbance wavelength). This wavelength is suitable for reducing background fluorescence in DNA sequencing and corresponds to the radiation wavelength of the GaAlAs diode laser which is 780 nm. The GaAlAs diode is used for irradiating the gel electrophoresis sandwich used for DNA sequencing. Formulas 1-16 are typical synthesized or proposed dyes and formula 17 is a suitable modified dye.

Formula 1 shows synthesized cyanine dyes having NCS as a reactive group for attachment of a biomolecule. In this embodiment, when X is H, the maximum absorbance wavelength is between 787 and 801 nm, depending on the solvent and the maximum emission wavelength is between 807 and 818 depending on the solvent. When X is --OCH₃, the maximum absorbance wavelength is 786 nm and the maximum emission wavelength is 806 nm. The quantum yield is high.

Formula 2 shows synthesized cyanine dyes some of which have a high quantum yield. For example, when R is ethyl or sulfonatobutyl and n is either 1 or 2, the maximum absorbance wavelength is between 762 and 778 nm. Depending on the solvent the maximum emission wavelength is between 782 and 800 nm.

In the synthesized dyes represented by formula 3, when R is SO₃ Na, the maximum absorbance wavelength is between 773 and 778 nm and the maximum emission wavelength is between 789 and 806 nm. The dye contains a functional hydroxy group to permit coupling to DNA strands for the purposes of sequencing DNA.

As shown by formula 5, a still further increase in quantum yield is obtained by coupling the nitrogen groups on the two sides of the heterocyclic base by a polymethylene group including a number of methylene monomers chosen to maintain the distance between them. Generally, the methylene chain is between four and ten groups long, with nine being preferable. Similarly, ethers may be used with the same number of groups. The ethers are more stable. A functional group may be attached to provide convenient coupling. This arrangement has the proper spectrum and higher quantum yield.

A proposed series of molecules represented by formula 6 is expected to have increased solubility to permit its use in an aqueous medium through the use of an amino group or ether group attached to the benzene ring. In this series, X may be oxygen or NH, Y may be NCS or H, and R may be H, NCS, CH₂ OH, CH₂ NCS or COOH according to tables 1 or 2.

In formula 7: R may be ethyl, 4-sulfonatobutyl, 3-amino propyl phthalimidobutyl, hexyl and pentyl carboxylates; Z may be S, O, CMe₂, Y may be H, SO₃ Na, SO₃ Et₃ NH, OCH₃, NO₂ CO₂ Na or CO₂ Et₃ NH and X may be H, N(CH₂ CO₂ CH₃)₂, CH₂ CH₂ OH, CHS, CH₂ CH₂ CH₂ 0H or --(CH₂)n--OH where the number n=any number between and including 8-12. ##STR1##

                  TABLE 1                                                          ______________________________________                                         Dye formula 6 series, when X = 0                                               X              Y           R                                                   ______________________________________                                         0              NCS         H                                                   0              H           NCS                                                 0              H           CH.sub.2 OH                                         0              H           CH.sub.2 NCS                                        0              H           COOH                                                ______________________________________                                    

                  TABLE 2                                                          ______________________________________                                         Dye formula 6 series, when X = NH                                              X              Y           R                                                   ______________________________________                                         NH             NCS         H                                                   NH             H           NCS                                                 NH             H           CH.sub.2 OH                                         NH             H           CH.sub.2 NCS                                        NH             H           COOH                                                ______________________________________                                    

In the series of proposed molecules represented by formula 8:

(1) When X and R are hydrogen, the absorbance wavelength is 767 nanometers and the emission wavelength is 787 nanometers, a high quantum yield.

(2) When X is equal to MeO and R is hydrogen, the absorbance wavelengths are slightly higher such as between 767 and 769 nanometers and the emission wavelength is slightly greater but the yield is slightly reduced.

(3) When X is a nitro group and R is hydrogen, the absorbance wavelength increases as well as the emission wavelength, with the absorbance wavelength being between 770 and 774 nanometers and emission wavelength being between 792 and 798 wavelengths with a high quantum yield.

(4) When X is NCS and R is hydrogen, the absorbance and emission wavelengths drop to between 768 and 769 nanometers for the absorbance wavelength and 788 nanometers for the emission wavelength with a quantum yield that is high.

(5) When X is NCS and R is OM the absorbance maximum wavelength is between 790 and 796 nm and the maximum emission wavelength is 814 nm.

(6) When X is C₃ H₆ OH and R is hydrogen, the maximum absorbance wavelength is between 766 and 768 nm and the maximum emission wavelength is between 788 and 790 nm.

(7) When X is NCS and R is SO₅ Na, the maximum absorbance wavelength is between 773 and 778 nm and the maximum emission wavelength is between 789 and 806 nm.

(2) When R is H, and X is --CH₂ CH₂ CH₂ --OH (hydropropyl) the aborbance wavelength is slightly reduced to between 766 and 768 nanometers and the emission wavelength is reduced to between 788 and 790 nanometers but the quantum yield is increased.

(3) When R is SO Na and X is NCS, the absorbance wavelength is reduced to 773 to 778 nanometers and the emission wavelength is reduced slightly to 789 to 806 nanometers. It produces a high quantum yield. ##STR2##

In the series of molecules of formula 9:

(1) When X is NCS the absorbance wavelength is between 764 and 770 nanometers and the emission wavelength is between 786 and 802 nanometers.

(2) When X is CH₂ CH₂ CH₂ OH the absorbance wavelength is between 762 and 770 nanometers and the emission wavelength is between 782 to 790 nanometers.

The series of molecules represented by formula 10 have been synthesized and are effective as IR labels when R1, R2 and X are as shown in table 3.

The series of molecules represented by formula 11 have been synthesized and are effective as IR labels when R₁ and X are as shown in table 4.

The series of molecules represented by formula 12 have been synthesized and are effective as IR labels when R₁, R₂ and X are as shown in table 5.

The series of molecules represented by formula 13 have been synthesized and are effective as IR labels when R₁, R₃ and R₂ are as shown in table 6.

The series of molecules represented by formula 14 have been synthesized and are effective as IR labels when R and X are as shown in table 7. ##STR3##

                  TABLE 3                                                          ______________________________________                                         R1               R2          X                                                 ______________________________________                                         --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 H           NCS                                               --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 H           OCH.sub.3                                         --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 H           NO.sub.2                                          --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 H           H                                                 --C.sub.2 H.sub.5 (1)                                                                           H           OCH.sub.3                                         --C.sub.2 H.sub.5 (1)                                                                           OCH.sub.3   OCH.sub.3                                         --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 OCH.sub.3   OCH.sub.3                                         --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                 OCH.sub.3   NCS                                               --C.sub.3 H.sub.8 NCS(Br)                                                                       H           H                                                 C.sub.2 H.sub.5 I                                                                               H           NCS                                               C.sub.4 H.sub.8 SO.sub.3 Na                                                                     SO.sub.3 Na NCS                                               C.sub.2 H.sub.5 (I)                                                                             SO.sub.3 Na NCS                                               ______________________________________                                    

                  TABLE 4                                                          ______________________________________                                         R.sub.1               X                                                        ______________________________________                                         --C.sub.2 H.sub.5 (1) OPhNCS                                                   --C.sub.4 H.sub.8 SO.sub.3 (Na)                                                                      OPhNCS                                                   --C.sub.3 H.sub.8 NCS(Br)                                                                            OPh                                                      phthalimide butyl     OPhNCS                                                   ______________________________________                                    

                  TABLE 5                                                          ______________________________________                                         R.sub.1        R.sub.2       X                                                 ______________________________________                                         --C.sub.3 H.sub.6 NCS(Br)                                                                     --C.sub.4 H.sub.8 SO.sub.3 H                                                                 C1                                                --C.sub.3 H.sub.6 NCS(Br)                                                                     --C.sub.2 H.sub.5                                                                            C1                                                --C.sub.3 H.sub.6 NCS                                                                         --C.sub.4 H.sub.8 SO.sub.3                                                                   OPh                                               --C.sub.3 H.sub.6 NCS(Br)                                                                     --C.sub.2 H.sub.5                                                                            OPh                                               C.sub.3 H.sub.6 NCS                                                                           C.sub.4 H.sub.8 SO.sub.3                                                                     PPh(pMeO)                                         ______________________________________                                    

                  TABLE 6                                                          ______________________________________                                         R.sub.1         R.sub.3     R.sub.2                                            ______________________________________                                         Et(I);          H           H                                                  (CH.sub.2).sub.4 SO.sub.3 (Na);                                                                H           H                                                  Et(I);          COOCH.sub.3 H                                                  Et(I)           COOCH.sub.3 --(CH).sub.4 --                                    (CH.sub.2).sub.4 SO.sub.3 (Na)                                                                 COOCH.sub.3 --(CH).sub.4 --                                    ______________________________________                                    

                  TABLE 7                                                          ______________________________________                                                R         X                                                             ______________________________________                                                Et(I.sup.-)                                                                              O                                                                    (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+)                                               O                                                                    Et(I.sup.-)                                                                              S                                                                    (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+)                                               S                                                             ______________________________________                                    

                  TABLE 8                                                          ______________________________________                                                    R                                                                   ______________________________________                                                    Et(I.sup.-)                                                                    (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+)                          ______________________________________                                          ##STR4##                                                                  

In another embodiment, the infrared dyes are selected to provide at least four dyes with each dye having its excitation and/or radiation spectra spaced sufficiently from each other dye so the fluorescence from each dye can be distinguished from each other dye either by the wavelength that excites it into fluorescence or the wavelength at which it fluoresces or both. The spacing is maintained sufficiently close to be excited by laser diodes. The dyes may be incorporated in probes and primers for attachment to oligonucleotides as described in Ruth, Jerry L. (1984) DNA 3, 123. The

There are many dyes suitable for such modification such as for example: (1) 3,3'-Diethylthiadicarbocyanine Iodide; (2) 3,3'-Diethylthiatricarbocyanine Perchlorate; (3) 3,3' Diethyloxatricarbocyanine Iodide; (4) 1,1',3,3,3'-Hexamethylindotricarbocyanine Perchlorate; (5) 1,1'-Diethyl-2,2'-dicarbocyanine Iodide; (6) 3,3'-Diethylthiadicarbocyanine Iodide; (7) 3,3'-Diethyloxatricarbocyanine Iodide; (8) 1,1',3,3,3'1,3'-Hexamethylindotricarbocyanine Perchlorate; (9) 1,1',3,3,3',3'-Hexamethylindotricarbocyanine Iodide; and (10) Indocyanine Green.

In the one embodiment, the dye has the formula shown in formula 17, with R being --CH₂ --CH₃. This dye is close to having the desired wavelength of maximum fluorescence and the wavelength of maximum absorbance may be modified by changing the functional group R. The unmodified dye may be obtained from Laboratory and Research Products Division, Eastman Kodak Company, Rochester, N.Y. 14650. It is advertised in the Kodak laser dyes, Kodak publication JJ-169.

The modifications with 1,3-propanediol can be made in a manner known in the art as illustrated by equation 1. For example, changes occur when different esters are formed replacing the ethyl alcohol in the original dye molecule (R equal --CH₂ --CH₃ of formula 17). If different glycol esters are formed, absorption maxima of these new near infrared dyes shift to the longer wavelengths. Moreover, new dyes may be synthesized rather than modifying existing dyes in a manner known in the art.

The absorption maximum is dependent on the distance of the O atoms in the glycol functional group. However, the fluorescence maxima of these new near infrared dyes are practically at same wavelength of the dye of formula 17, i.e. 819 nm. This indicates that only the excitation process has changed, i.e. to what energy level the transition occurs. The lowest vibronic level of first excited state remains unchanged. The absorption maxima of several such esters are: (1) ethylene glycol 796 nm (nanometers); (2) 1,3-Propanediol 780 nm; (3) 1,4-Butanediol 754 nm; (4) 1,6-Hexanediol 744 nm; (5) Triethylene glycol (#4) 790 nm; and (6) IR-144 (R═CH₂ --CH₃) 742 nm.

In the preferred embodiment, the fluorescence maximum wavelength is about 819 nanometers and the detector is adjusted to receive this wavelength and not others by appropriate filtering. The absorption maxima is selected to be different and to correspond ##STR5## to the preferred available laser diode emission. For example, in formula 17, R may be any of the following four groups, depending on the desired wavelength of the diode emitted light, which are:

(1) --CH₂ --CH₂ --OH for an absorption wavelength of 796 nanometers;

(2) --CH₂ --CH₂ ----CH₂ --OH for an absorption wavelength of 780 nanometers, which is the preferred embodiment;

(3) --CH₂ --CH₂ --CH₂ --CH₂ --CH₂ --CH₂ --OH for an absorption wavelength of 745 nanometers;

(4) --CH₂ --CH₂ --O--CH₂ --CH₂ --O--CH₂ --O--CH₂ --OH for an absorption wavelength of 790 nanometers; and

(5) --CH₂ --CH₂ --SH for an emission wavelength of 810 nanometers.

Dye synthesis is illustrated by chart 1, which shows the synthesis of formulas 7, 8, 9, and 10. In the chloro intermediate in chart 1, R and R₁ may have the values shown in table 9. Chart 2 shows the synthesis of formula 11. Chart 3 shows the synthesis of formulas 1 and 12. Chart 4 shows the synthesis of formula 13. Chart 5shows the synthesis of formula 14. Chart 6 shows the synthesis of formula 15. Chart 7 shows the

                  TABLE 9                                                          ______________________________________                                                R              Y                                                        ______________________________________                                                C.sub.4 H.sub.8 SO.sub.3 (H)                                                                  H                                                               C.sub.2 H.sub.5                                                                               H                                                               C.sub.3 H.sub.6 NH.sub.2 HBr                                                                  H                                                               C.sub.4 H.sub.8 SO.sub.3 (H)                                                                  SO.sub.3 NC                                                     C.sub.2 H.sub.5                                                                               SO.sub.3 NC                                                     C.sub.4 H.sub.8 SO.sub.3 (H)                                                                  OCH.sub.3                                                       C.sub.2 H.sub.5                                                                               OCH.sub.3                                                ______________________________________                                    

synthesis of formula 16. In these charts X is a halogen. ##STR6##

In FIG. 1, there is shown a perspective view of an embodiment of sequencer in which the method of this invention may be performed. This sequencer is described in structure and operation in the aforementioned U.S. patent application Ser. No. 07/570,503 filed Aug. 21, 1990; U.S. patent application Ser. No. 07/078,279 filed Jul. 27, 1987; and U.S. Pat. No. 4,729,947, all of which are entitled DNA SEQUENCING and which were filed by Middendorf et al. on Mar. 29, 1984.

In FIG. 2, there is shown a sectional view of the remote station 122A taken through section lines 2--2 of FIG. 1 having an electrophoresis section 140, a scanning section 142, an electrophoresis power supply 144, a system power supply section 144A, an analog board 146 and a digital board 148. The electrophoresis section 140 is positioned near the front of the cabinet and a portion of it is adapted to be scanned by the scanning section 142 in cooperation with circuitry on the analog board 146 and the digital board 148. All of the apparatus are electrically connected to the power supply section 144A for such operation.

To separate different DNA fragments into bands, the electrophoresis section 140 includes a gel sandwich 150, an upper buffer assembly 152, support assembly 154, and a lower buffer assembly 151 positioned to enclose the bottom of the gel sandwich 150. In the embodiment of FIG. 2, the gel sandwich 150 is held substantially vertically and its temperature is controlled during operation. Bands are separated by applying voltage to the upper buffer assembly 152 and lower buffer assembly 151 and scanned by the scanning section 142.

To support the gel sandwich 150, the support assembly 154 includes a pair of upper side brackets and lower side brackets 160 and 162 (only one of each pair being shown in FIG. 2), a temperature control heating plate 164, and a plastic spacer, shown at 166A-166C, in FIG. 2. The entire structure is supported on the support assembly 154 which mounts the upper and lower side brackets 160 and 162.

The upper and lower side brackets 160 and 162 are shaped to receive the gel sandwich 150 and hold it in place in juxtaposition with the scanning section 142. The spacer as shown as 166A-166C space the temperature control heating plate 164 from an apparatus support plate 168 and maintain it at a constant selected temperature above ambient temperature. In the preferred embodiment, the temperature is maintained at 50 degrees Centigrade and should be maintained in a range of 30 degrees to 80 degrees.

The scanning section 142 includes a laser diode assembly (not shown in FIG. 2), a microscope assembly 172, a photodiode section 174 and a scanner mounting section 176. The laser diode assembly (not shown in FIG. 2) is positioned at an angle to an opening in the heating plate 164 so that light impinges on the gel sandwich 150 to cause fluorescence with minimum reflection back through the microscope assembly 172.

To receive the fluorescent light, the microscope assembly 172 is focused on the gel sandwich 150 and transmits fluorescent light emitted therefrom into the photodiode section 174 which converts it to electrical signals for transmission to and processing by the analog and digital boards 146 and 148 which may provide further analysis of data. The scanning section 142 moves along a slot in the apparatus support plate 168 which is mounted to the scanner mounting section 176 during this operation in order to scan across the columns in the gel sandwich 150.

The scanner mounting section 176 includes a mounting plate 180, a bearing 182, a stepping motor 184, a slidable support 186 and a belt and pulley arrangement 185, 188, 188A. The mounting plate 180 is bolted to the apparatus support plate 168 and supports an elongated bearing plate 182, a stepping motor 184 and two pulleys 188 and 188A. The elongated bearing plate 182 extends the length of the gel sandwich 150.

To permit motion of the laser diode assembly (not shown) and microscope assembly 172 with respect to the gel sandwich 150, the slidable support 186 supports the microscope assembly 172 and diode assembly and slidably rests upon the bearing plate 182. An output shaft 183 of the stepping motor 184 drives a pulley 188B through pulley 188, belt 185, and pulley 188A and the pulley 188B drives a belt (not shown) that is clamped to the slidable support 186 to move it the length of the gel sandwich 150 during scanning by the laser diode and microscope assembly 172 which rest upon it. The stepping motor 184 under the control of circuitry in the digital board 148 moves the pulley 188B to move the belt (not shown) and thus cause scanning across the gel sandwich 150.

As shown in this view, the electrophoresis power supply 144 is electrically connected to buffer in the upper buffer assembly 152 through an electrical connector 194 and to the lower buffer assembly 151 through a connector not shown in FIG. 2.

The upper buffer assembly 152 includes walls 197 forming a container to hold a buffer solution 195 and a cover 199 formed with a lip to fit over the walls 197 from the top and containing a downwardly extending flat member spaced away from the side walls and holding a conductor 211. The conductor 211 is electrically connected to the source of power through connector 194 which is mounted to the top of the cover 199 to permit electrical energization of the buffer solution 195.

The bottom buffer assembly 151 includes enclosed walls 201 defining a container for holding a buffer solution 203 and a cap 205 closing the container 201 and having a downwardly extending portion 213 extending into the buffer 203 for supporting a conductor 207 for applying energy to the bottom buffer solution 203. The gel sandwich 150 extends downwardly into the buffer solution 203 and upwardly into the buffer solution 195 to permit the electrical contact for electrophoresis.

In FIG. 3, there is shown a sectional view taken through lines 3--3 of FIG. 1 showing the electrophoresis section 140, the scanning section 142 and the electrophoresis power supply section 144 mounted together to illustrate from a top view the arrangement of the apparatus support plate 168, the heater plate 164, the gel sandwich 150, a microscope assembly 172 and a photodiode assembly 174. The heater plate 164 and apparatus support plate 168 have slots running in a horizontal direction orthogonal to the lanes of DNA in the electrophoresis section 140 sized to receive the ends of a laser diode assembly 170 and the microscope section 172 for scanning thereof.

To cooperate with the separation and scanning of DNA bands, the gel sandwich 150 includes a front glass plate 200, a gel section 202 and a rear glass plate 204 mounted in contact with the heater plate 164 and having a section exposed for scanning by the laser diode assembly 170 and the microscope assembly 172. The rear glass plate 204 contacts the heater plate 164 and is separated from the front plate 200 by the gel section 202 within which DNA separation takes place. The front and rear glass plates may be any type of glass but are preferably soda lime which has low fluorescence in the infrared and near infrared regions and is prepared by a process that provides optically flat surfaces without grinding.

To transmit light to the gel sandwich 150, the laser diode assembly 170 includes a housing 210, a focusing lens 212, a narrow band pass filter 214, a collimating lens 216 and a laser diode 218. The laser diode 218 emits infrared or near infrared light which is collimated by the laser collimating lens 216 and filtered through the narrow band pass infrared filter 214. This light is focused by the focusing lens 212 onto the gel sandwich 150. Preferably, the point of focus on the gel section 202 of the gel sandwich 150 lies along or near the central longitudinal axis of the microscope section 172 and the photodiode section 174.

The thickness of the glass plates and the gel, the position of the laser and sensor and their angle of incidence are chosen, taking into consideration the refractive index of the gel and glass so that the light from the laser is absorbed by a maximum number of markers for one channel. The light from the laser is not directly reflected back because the angle of incidence to normal is equal to the Brewster's angle at the first interface and is such as to impinge on the markers with full intensity after refraction but not be reflected by subsequent layers of the gel sandwich 150 into the sensor and the sensor views a large number of markers that fluoresce in a line of sight of substantial concentration.

To maintain temperature control over the laser diode, the housing 210: (a) is coupled to a heat sink through a thermal electric cooler 220, and (b) encloses the focusing lens 212, narrow band pass filter 214, collimating lens 216 and laser diode 218; and (c) accommodates the electrical leads for the diode. To receive and focus light emitted by fluorescent markers from the gel section 202 in response to the light from the laser diode assembly 170, the microscope assembly 172 includes a collection lens 230, a housing 232 and a coupling section 234. The microscope assembly 172 is adapted to be positioned with its longitudinal axis centered on the collection lens 230 and aligned with the photodiode section 174 to which it is connected by the coupling section 234. For this purpose, the housing 232 includes a central passageway in which are located one or more optical filters with a band pass matching the emission fluorescence of the marked DNA strands along its longitudinal axis from the axis of the collection lens 230 to the coupling section 234 which transmits light to the photodiode section 174. With this arrangement, the collection lens 230 receives light from the fluorescent material within the gel section 202 and collimates the collected light for optical filtering and then transmission to the photodiode assembly 174.

To generate electrical signals representing the detected fluorescence, the photodiode assembly 174 includes a housing 240 having within it, as the principal elements of the light sensors, an inlet window 242, a focusing lens 244, a sapphire window 246 and an avalanche photodiode 248. To support the avalanche photodiode 248, a detector mounting plate 250 is mounted within the housing 240 to support a plate upon which the avalanche photodiode 248 is mounted. The inlet window 242 fits within the coupling section 234 to receive light along the longitudinal axis of the photodiode assembly 174 from the microscope assembly 172.

Within the housing 240 of the photodiode assembly 174, the sapphire window 246 and avalanche photodiode 248 are aligned along the common axis of the microscope assembly 172 and the photodiode assembly 174 and focuses light transmitted by the microscope assembly 172 onto a small spot on the avalanche photodiode 248 for conversion to electrical signals. A thermoelectric cooler 252 utilizing the Peltier effect is mounted adjacent to the detector mounting plate 250 to maintain a relatively cool temperature suitable for proper operation of the avalanche photodiode 248.

The lower buffer assembly 151 (FIG. 2) includes outer walls 201 and a bottom wall forming a compartment for buffer solution which encloses the bottom of the gel sandwich 150.

As best shown in this view, the stepping motor 184 rotates the belt 185 to turn the pulley 188A, which, in turn, rotates pulley 188B. The pulley 188B includes a belt 177 extending between it and an idler pulley 179 and attached at one location to the slideable support 186 to move the scanning microscope and laser lengthwise along the gel sandwich 150 for scanning purposes. The motor 184, by moving the carriage back and forth, accomplishes scanning of the gel sandwich 150.

In FIG. 4, there is shown a fragmentary perspective view of the gel sandwich 150 and the upper buffer assembly 152 mounted to each other showing the outer glass plate 200 cut away from the rear glass plate 204 to expose the gel section 202 to buffer solution within the upper buffer assembly 152. With this arrangement, samples may be pipetted between the glass plates 200 and 204 and moved downwardly by electrophoresis beyond the upper buffer assembly 152 and through the gel sandwich 150 to the bottom buffer (not shown in FIG. 4).

In FIG. 5, there is shown a broken away view of the gel sandwich 150 illustrating the upper buffer assembly 152 and the lower buffer assembly 151 connected to it at each end. As shown in this view, the cover 199 includes a connecting post 214 which receives the conductor 211 for connection to the downwardly extending portion of the cover 199 into the buffer compartment. Between the glass plates 200 and 204 (FIG. 4) of the gel sandwich 150, are a plurality of downwardly extending recesses 221 in the gel section 202 (FIG. 4) between the plates. DNA sample is pipetted into these recesses to form channels for electrophoresing to the lower buffer assembly 151.

To form an electrical connection through the gel sandwich 150 from the upper buffer assembly 152 to the lower buffer assembly 151, a connecting post 218 is connected to the cover 205 of the lower buffer assembly 151 for receiving the conductor 207 which extends downwardly to the downwardly extended plate 213 and into the buffer solution.

In FIG. 6, there is shown a block diagram of the circuitry used to control the remote station 122A of the embodiment of FIG. 2 having a control, correlation and readout section 250, the scanner drive 176, the motor assembly 184 for moving the scanner drive 176, and the sensing configuration $252. The sensing configuration 252 includes the laser assembly 170 and the sensor assembly 174 which receives signals, removes some noise, and transmits the signals for display and read out in the control, correlation and read out section 250, while the scanner drive 176 and motor for the scanner drive 184 receive signals from the control, correlation and read out section 250 to control the motion of the sensor back and forth across the gel sandwich. This overall configuration is not part of the invention of this application except insofar as it cooperates with the sensing configuration 252 to scan the DNA and determine its sequence in accordance with the embodiments of FIGS. 1-5.

To drive the sensor 174 from position to position, the motor assembly 184 includes a stepper motor 254 and a motor driver 256. The motor driver 256 receives signals from the control correlation and read-out section 250 and actuates the stepper motor 254 to drive the scanner drive 176. The scanner drive 176 is mechanically coupled to a stepping motor 254 through a belt and pulley arrangement for movement back and forth to sense the electrophoresis channels on the gel sandwich 150 (FIG. 3). The stepping motor 254 and driver circuitry 256 are conventional and not themselves part of the invention.

The control, correlation and read out system 250 includes a computer which may be any standard microprocessor 260, a television display or cathode ray tube display 262 and a printer 264 for displaying and printing the results of the scans.

To sense data, the sensing configuration 252 includes, in addition to the laser 170 and the sensor 174, a chopper circuit 270, a sensor power supply 272, a preamplifier 274, a lock-in amplifier 276, a 6-pole filter 278, a 12-bit analogue to digital converter interface circuit 280 and a laser power supply 282.

The sensor 174 receives light from the laser 170 after it impinges upon the gel sandwich 150 (FIG. 3) and transmits the signals through preamplifier 274 to the lock-in amplifier 276. The sensor receives signals from the sensor power supply 272. The chopper circuit 270 provides pulses at synchronized frequencies to the lock-in amplifier 276.

The laser 170 receives power from the power supply 282 which is controlled by the chopper circuit 270 so that the signal from the laser is in synchronism with the signal applied to the lock-in amplifier 276 so that the output from the lock-in amplifier 276 to the 6-pole filter 278 discriminates against unwanted signal frequencies. This signal is converted to a digital signal in the 12-bit analogue to digital converter 280 which serves as an interface to the computer 260.

With this arrangement, the scanning rate may be set to discriminate against noise and the synchronized demodulation from the chopper control further reduces noise, particularly discriminating against the natural fluorescense of the glass in the gel sandwich 150 (FIGS. 2 and 3).

From the above summary, it can be understood that the techniques for the sequencing of fluorescence labeled DNA of this invention have several advantages, such as: (1) because the dyes have their emission spectra in the infrared or near infrared light spectrum, small inexpensive infrared diode lasers may be used; and (2) they are characterized by relatively low noise.

Although a preferred embodiment of the invention has been described with some particularity, many modifications and variations are possible in the preferred embodiment within the light of the above description. Accordingly, within the scope of the appended claims, the invention may be practiced other than as specifically described. 

What is claimed is:
 1. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within the infrared and near infrared region wherein the fluorescent label includes a chromophore having the formula: ##STR7## wherein X consists of one of O or NH, Y consists of one of NCS or H; and R consists of one of H, NCS, CH₂ OH, CH₂ NCS, COOH, irradiating the strands with light having a wavelength within one of the infrared and near infrared regions; and detecting the light emitted from the fluorescent labels.
 2. A method of identifying strands of DNA in accordance with claim 1 in which the step of marking includes the steps of attaching fluorescent markers to probes and hybridizing the probes to the strands.
 3. A method of identifying strands of DNA in accordance with claim 2 in which the step of detecting includes the step of detecting by irradiating with an infrared laser diode light source.
 4. A dye comprising: ##STR8## where X is (CH₂)_(n) ; n=4-10 or X is --CH₂ --CH₂ --O--CH₂ --CH₂ --O--CH₂ --CH₂ --.
 5. A dye comprising: ##STR9## wherein X is one of O or NH; Y is one of NCS or H; R is one of H, NCS, CH₂ OH, CH₂ NCH or COOH. 